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Abstract 

A very simple closed-form formula for Sheppard's corrections is re- 
covered by means of the classical umbral calculus. By means of this 
symbolic method, a more general closed-form formula for discrete par- 
ent distributions is provided and the generalization to the multivariate 
case turns to be straightforward. All these new formulae are particu- 
larly suited to be implemented in any symbolic package. 

keywords: raw moment, grouped moment, Sheppard's correction, umbral 
calculus, Bernoulli polynomials 

1 Introduction 

In the real world, continuous variables are observed and recorded in finite 
precision through a rounding or coarsening operation, i.e. a grouping rule. 
A compromise between the desire to know and the cost of knowing is then 
a necessary consequence. The literature on grouped data spans different 
research areas, see for example [17]. In particular, attention has been paid 
in the literature to the computation of moments when data are grouped 
into classes. Indeed, the method of moments performs estimation less well 
than Maximum Likelihood, but it is useful in situation when Maximum 
Likelihood is not feasible or has poor small sample size performance. 

The moments computed by means of the resulting grouped frequency 
distribution are looked upon as a first approximation to the moments of the 
parent distribution, but they suffer from the error committed in grouping. 
The correction for grouping is a sum of two terms, the first depending on 
the length of the grouping interval, the second being a periodic function of 
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the position. For a continuous random variable (r.v.), this very old problem 
was first discussed by Thiele [27], who studied the second term missing the 
first, and then by Sheppard [26] who studied the first term, missing the 
second. Both Bruns [4j and Fisher |15j proved that the second term can be 
neglected under suitable hypothesis and so for using Sheppard's corrections, 
that are nowadays still employed. If a„ denotes the n-th moment of the 
grouped distribution, then the n-th raw moment a„ of the continuous parent 
distribution can be constructed via Sheppard's corrections as 

an = Yl (2'"' - 1) Bj y ~an-j , (1) 

where {Bj} are the Bernoulli numbers^ and h is the length of the group- 
ing intervals. The derivation of Sheppard's corrections was a popular topic 
in the first half of last century, see [16J for an historical account. This be- 
cause Sheppard deduced equation ([!]) by using a suitable summation formula 
whose remainder term goes to zero when the density function has high order 
contact with the x-axis at both ends. So there was a considerable contro- 
versy on the set of sufficient conditions to be required in order to use formula 
([TJ. All these sufficient conditions can be removed, if the rounding lattice is 
assumed to be random and the average is made. Less research on moment 
corrections has appeared since the definitive work of Kendall [18] . although 
recently the appropriateness of Sheppard's corrections was re-examined in 
connection with some applications, see for example [6J and [28] . 

Grouping includes also censoring or splitting data into categories during 
collection or publication, and so it does not only involve continuous vari- 
ables. The derivation of corrections for raw moments of a discrete parent 
distribution followed a different path from Sheppard's corrections. They 
were first given in the Editorial of Vol.1, no. 1, of Annals of Mathemati- 
cal Statistics (page 111). The method used to develop the general formula 
was extremely laborious. Some years later, Craig [5] considerably reduced 
and simplified the derivation of these corrections by using the logarithm of 
the moment generating function, that is working on cumulants instead of 
moments. Craig proposed the same method to derive formula ([1]), stating 
these corrections on the average and so avoiding to require any conditions 
on the parent distribution. At the moment, his method represents the most 

^Many characterizations of the Bernoulh numbers can be found in the hterature and 
each could be used to define these numbers. Here we refer to the sequence of numbers 
such that Bo = 1 and Y.l=o {"t^)Bk = for n = 0, 1, 2, . . . . 
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general way to find such corrections, botli for continuous and for discrete 
parent distributions. 

In this paper, we propose to overcome Craig's methods by stating Shep- 
pard's corrections on the average through the employment of the classical 
umbral calculus. This approach is partially motivated by a similar employ- 
ment of the classical umbral calculus in wavelets theory [23\ 125) . Indeed, 
the reconstruction of the full moment information a„ in ([T| through the 
grouped moments a„ can be seen as some kind of multilevel analysis, like 
in wavelets analysis. But the paper is inspired also from the belief that the 
classical umbral calculus can be fruitful used in statistics, coming out the 
side authentically algebraic of many techniques commonly used to manage 
number sequences related to r.v.'s. 

The umbral calculus was studied by Rota and his collaborators for long 
time [7] . The version introduced in [22] represents a new way of dealing with 
number sequences, which is represented by a symbol a, called an umbra. 
More precisely, an unital sequence 1, ai, 02, . . . is associated to the sequence 
1, a, a^, . . . of powers of a through a linear functional E that looks like the 
expectation of a r.v.. So the classical umbral calculus is no more than a 
symbolic tool by which handling number sequences. An umbra looks as the 
framework of a r.v. with no reference to any probability space, someway 
getting closer to statistical methods. Compared with previous symbolic 
methods employed in statistics, see for example jT] and [20], by a theoretical 
point of view it has the advantage to reduce the combinatorics of symmetric 
functions, commonly used by statisticians, to few relations which cover a 
great variety of calculations [1^. By a computational point of view, the 
efficiency of umbral calculus in manipulating expressions involving r.v.'s 
has been tested on the theory of /c-statistics |10) and their generalizations 
|12j as well as in manipulating f7-statistics and product moments of sample 
moments [11] . Recently, also the free cumulant theory has been approached 
by means of this new syntax [13] , showing promises for future developments 

Finally, the employment of the classical umbral calculus in corrections of 
moments for grouped data has one more advantage. Except for the papers of 
Craig [5] and Baten |3] , no attention was paid to multivariate generalizations 
of Sheppard's corrections, probably due to the complexity of the resulting 
formulae. The notion of multiset is the combinatorial device that, via the 
symbolic method, gives rise to a closed- form formula for multivariate parent 
distributions that could be implemented in few steps by using any symbolic 
package. 

The paper is structured as follows. Section 2 is provided for readers un- 
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aware of the classical umbral calculus. Let us underline that the theory of 
the classical umbral calculus has now reached a more advanced level com- 
pared to the elements here resumed. We have chosen to recall terminology, 
notation and the basic definitions strictly necessary to deal with the object 
of this paper. In particular, we recall the notion of the Bernoulli umbra, 
as introduced in [22]. The Bernoulli umbra is the keystone for the umbral 
handling of moment corrections of grouped data. In Section 3, Sheppard's 
corrections are given and extended to discrete parent distributions. For the 
continuous case, the key is to represent integrals by means of suitable umbral 
Bernoulli polynomials. For the discrete case, the key is the generalization of 
the so-called multiplication theorem to umbral Bernoulli polynomials. Dif- 
ferently from the literature on this subject, where first the expressions of 
corrected moments in terms of raw ones are deduced and then these ex- 
pressions are inverted by solving a linear system, here we deduce directly 
the corrections on the moments, due to closed-form formulae. Section 4 is 
devoted to the multivariate generalizations of Sheppard's corrections. Some 
concluding remarks end the paper. 

2 Background on umbral calculus 

In the following, terminology, notation and some basic definitions of the 
classical umbral calculus are recalled, as introduced by Rota and Taylor in 
[22] and subsequently developed by Di Nardo and Senato in [8] and [9]. We 
skip any proof: the reader interested in-depth analysis is referred to the 
papers quoted belove. 

The classical umbral calculus is a syntax consisting of the following data: 

i) a set A = {a,/3, ...}, called the alphabet, whose elements are named 
umbrae; 

a) the polynomial ring 7^ = M[a;] in the indeterminate x with M the field of 
real number^; 

Hi) a linear functional E : TZ[A] ^ TZ, called evaluation, such that = 1 
and E[x"-a'^l3^ ■ ■ ■ 7^] = x^E[a^]E[l3^ ■ ■ ■ El^'^] for any set of distinct 
umbrae in A and for n, m,i,j, . . . ,k nonnegative integers (the so-called 
uncorrelation property); 

■^For the aim of this paper, we need something more of the usual commutative integral 
domain whose quotient field is of characteristic zero, required in [25] . Due to the framework 
we deal, it is more convenient to refer directly to the field of real numbers R. 
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iv) an element e £ A, called augmentation, such that i?[e"] = 5o,n; for any 

nonnegative integer n, where 6ij = 1 if i = j, otherwise being zero; 

v) an element u (z A, called unity umbra, such that E[u'^] = 1, for any 

nonnegative integer n. 

A sequence oq = 1, ai, 02, . . . in 7^ is umbrally represented by an umbra 
a when 

E[a'] = ai, for i = 0,1,2,... . 

The elements Oj are called moments of the umbra a, in analogy with the 
moments of a r.v. X. In particular, an umbra is said to be scalar if the 
moments are elements of M while it is said to be polynomial if the moments 
are polynomials of 

An umbral polynomial is a polynomial p G 7^[A]. The support of p is the 
set of all umbrae occurring in p. If p and q are two umbral polynomials then 
p and q are uncorrelated if and only if their supports are disjoint. We said 
that p and q are umbrally equivalent if and only if 

E[p] = E[q], in symbols p ^ q. 

It is possible that two distinct umbrae represent the same sequence of 
moments, in such case they are called similar umbrae. More formally two 
umbrae a and 7 are similar when a" is umbrally equivalent to 7", for all 
n = 0, 1, 2, . . . in symbols 

aEE7^Q"~7" n = 0,1,2,.... 

Given a sequence 1,01,02,... in TZ there are infinitely many distinct, and 
thus similar umbrae, representing the sequence. 

Two umbrae a and 7 are said to be inverse to each other when a + 
7 = e. We denote the inverse of the umbra a by — l.a. Note that they are 
uncorrelated. Recall that, in dealing with a saturated H umbral calculus, the 
inverse of an umbra is not unique, but any two umbrae inverse to any given 
umbra are similar. 

The Bernoulli umbra. A definition of the Bernoulli umbra l is given 
in |22] : up to similarity, the Bernoulli umbra is the unique umbra such that 

(i + l)"+i ~ (2) 

^Roughly speaking, a saturated umbral calculus is defined when the alphabet A is 
extended with a set including all auxiliary umbrae like —l.a. In 22 , a formal definition 
of saturated umbral calculus is given. 
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for all positive integers n. Then, the Bernoulli umbra l turns to be the 
unique (up to similarity) umbra such that E[i^] = i?„, for n = 0, 1, 2, ... , 
where {Bn} are the Bernoulli numbers. By using equivalence ([2|), the main 
properties of the Bernoulli numbers can be easily proved, as for example 
B2n+i = for all nonnegative integers n. Here, we just recall that the 
Bernoulli polynomials {Bn{x)} are such that 



x"-'^ ~ (x + if. (3) 



The Bernoulli polynomials are characterized to have an average value of 
over the interval [0, 1] for all nonnegative integers n, that is 



/ Bn{x)dx = {). 
Jo 



(4) 



We sketch a simple ^^umbral" proof. Since by simple computations we have 
fQBn{x)dx = E Jg^(x + z.)"dx , then jj^ i?„(a;) dx ~ /q^(x + t)" dx ~ [(t + 
lyi+i _ + 1) ~ 0, as {l + - ~ 0, due to equivalence 

([2]). The approach here introduced allows us to manage integrals by using 
suitable umbral polynomials. 

Theorem 2.1. // —I.l is the inverse of the Bernoulli umbra and {i?„(x)} 
are the Bernoulli polynomials, then 



E[Bn{-l.i)] = [ Bn{x)dx, 
Jo 



(5) 



for all nonnegative integers n. 

Proof. Via equivalence ([3]), we have l.i) ~ (— l.t + /,)" ~ e^. □ 
Corollary 2.2. If p{x) £ TZ and h £ M\{0},c € M then 

rc+h 



E\p{-l.i)] = j\{u) dn, E \p{-l.{hi) + c)] = \ £ 



p{t)dt. (6) 



Proof. Without loss of generality, assume p{x) a polynomial of degree n, for 
some nonnegative integer n, that is p{x) = Yllt=o^kX^. By recalling that 
E[{—1.l)^] = l/{k + 1) for all nonnegative integers k [22], we have 

Eb(-1.0]=ErXT=E^W ^'dx=/ p(x)dx. 

t^a^ + ^ t^a Jo Jo 



k=0 k=0 
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The latter of equations ([6]) follows from the former. Indeed in J^^^p{t)dt 
replace t by hu + c. The result follows by recalling the equivalence —l.{hL) = 
h{—l.L), proved in [H] for any umbra a (z A. □ 

In the following, we give an umbral proof of the so-called multiplication 
theorem [21] for the umbral Bernoulli polynomials. We will use this property 
in the next section. 

Theorem 2.3. If m is a nonnegative integer, then for all nonnegative in- 
tegers n 

t \" 1 v-^ / k 



fa;+ — V ~ — y ( x + — + /,) . (7) 

k=o 



Proof. Since — l.i + t = e, we have 



X H 

m 



j=0 

By using the former of equations ([6]) , we have 

1 ■ Tn — 1 fc+l 

(-1.L+—] ~ / (x + —) dxc^y^ I (x + —) dx. 

^ JO ^ J— ^ 



If we set y = mx — k, then 



fc = '' m fc=0 ''^ k=0 ^ ' 

The result follows by substituting these last two equivalences in ([8]). □ 
By linearity, if p{x) G 7^, then p (x + ^) ~ ^ Y]k=Q vix^^^i) . 

3 Corrections to grouped moments 

Usually, the n-th moment of the grouped distribution is represented by 

~o!n = Y.^lpUi-\<^<i^^^. (9) 

id! \ / 
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where P(-) denotes the parent distribution, are the mid-point of classes 
partitioning the range and h is the width of each class. In practice, we know 
an estimate of a^, since when aj^ is computed, the probabihty 



is replaced by the frequency Ni/N of the corresponding class and only a 
finite number of classes is considered (so the summation is over a finite 
number of terms). In establishing approximate relations between the set of 
the raw moments dn and the set of grouped moments a^, a way to avoid any 
assumption other than the existence of the involved moments is to employ 
another set of constants an- These constants the average of a'^, when 

the set {^i} is replaced by a suitable set of random points obtained by 
assuming random the rounding lattice. By the umbral method, we will 
prove that the expression of the raw moments an in terms of the constants 
dn gives the corrections to grouped moments. The discussion on the nature 
of approximation in replacing dn by d'n goes beyond the aim of this paper, 
and it has been already tackled in the literature, see for example [21 [17] and 
[29]. 

Continuous parent distribution: Sheppard's corrections. Let / 

be a continuous probability density function of a i.v.X over (—00,00). As 
usual. 



denotes the n-th moment of X about the origin. In the following, we assume 
that all absolute moments exist. The moments calculated from the grouped 
frequencies are given by 



Indeed in Q, suppose to replace by + U, with U an uniform r.v. over 
(— ^h). Equation (fTT|) follows by setting a„ = E[d'^{U)]. 

Theorem 3.1 (Sheppard's correction). If the sequence {dn} in Ul\) is um- 

brally represented by the umbra a and the sequence {a„} in ^0\) is umbrally 
represented by the umbra a, then 





(10) 






(12) 
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Proof. We have 



E[{a + xT] = ^ E{a^] x""^ = J J] Q x^-^ f{t) dt, 
so that (a + x)" ~ /-^(^ + ^)^fi't) dt, for all nonnegative integers n. Since 



1 



oo f'^h 



{t + xTf{t) dtdx 



1 f^^ 



and due to the linearity of E 

E[{a + xY] dx = E 



1 r^'' 



equation ()12p follows by observing 



1 r^'' 



(a + x)"-dx 



1 



(a + x)" dx ~ -l.{hL) + a 



where the last equivalence follows from the latter of Q with c = —h/2. □ 

As a first corollary, we get equation (JT]). Note that from equivalence 
(fT2]) . we have 

1^ 



a = a + h { i + 



(13) 



Then, by using the binomial expansion and by applying the linear functional 
E, for all nonnegative integers n we have 



W a 



n-j- 



In order to recover equation ([T]), all we need is to prove that E + ^)"' = 
Bj{2^~^ — 1) for all nonnegative j. This is done in the following proposition. 



Proposition 3.2. If l is the Bernoulli umbra, then for all nonnegative j 



(14) 
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Proof. Note that for all nonnegative integers j we have 



i + 



1 



l' l' 1 
■^•2 + 2+^+2 



fc=0 



1 1.. 



(15) 



where l and i' denote uncorrelated Bernoulli umbrae. For all nonnegative 
integers k, from equation ([6]) we have 

2k+i 



^0 



;x + l)'=dx 



1 



/c + l k + 1 
Substituting this last equivalence in (jlSp . we have 



2'=+i(-l.,)'= - (-l.t') 



1 

L + - 

2 



E 

A:=0 
fc=0 



+ - 
2 



2(-l.i)'' - 

E 

fc=0 



"2 



-1.- H \-L 

2 2 



-1.- 

2 



by which equivalence ()14p follows. 



□ 



As a second corollary of Theorem 13. H we recover equations giving the n- 
th moment of grouped data in terms of raw moments. Indeed, in equivalence 
p2p . by using the binomial expansion and by applying the linear functional 
E, we have 



-n=t: (") 



E 



(16) 



for all nonnegative integers n. As before, we need to evaluate the moments 
of —l.i — ^. This can be done by using only equation ([6]). Indeed, for all 
nonnegative integers j we have 

fi 



E 



X — - j dx 



0, if j is odd, 



So from equation (jl6p and by using (jl7p . we have 



a. 



n-2j 

2j + r 



(17) 



(18) 
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Remark 3.3. Theorem 13.11 still holds, when the moments are referred to a 
parent distribution over (a, b). In (jlOp and (jlip . instead of using the domain 
of integration (—00, 00), we integrate over (o, b) and we do the same in the 
proof of Theorem 13. 1[ In ([9|), we refer the summation to i = 1, . . . ,p, with 
p the number of classes partitioning (a, b) with width h. Here the r.v. d'^{U) 
is obtained by replacing S,i with a + {i — + U. 

Discrete parent distribution. Assume that m equidistant consecu- 
tive values of a discrete r.v. are grouped into a frequency class of width h. 
The m smaller intervals of width h/m go to make up the class width h in 
such a way that the m values of the variable represent the mid-points of the 
sub-intervals. Without loss of generality, we assume that 

/ ih\ h fih\ 
V ml m \m I 



with 



/ ( — 1 > Vi € Z and 



m 



Y.-f(-) 

^-^ m \m I 



In ([9]), replace by 

ra — \\ h 
mi + U 



m 

with U a r.v. which has any of m possible values 0,1, ... ,m — 1 equally 
probable. The m values of a'^iU) corresponding to the m distinct methods 
of grouping a discrete distribution are 

nm-l L r ^ 



^-^ \ 1 I m ^-^ m 



(mi + U — j) — 
m 



By doing some calculations, we recover the expression of moments calculated 
from the grouped frequencies 

a^ = EK{U)] = -Y,-Y.( 5 f( — )- 19 

m ^-^ m ^-^ \ m 2m / \ m / 

In the following, we denote by the n-th moment of the discrete parent 
distribution, that is 

n 



h \ - / sh 

an = 

m — ' \m J \m 



E - /(-)■ P«) 

^-^ \ m \m/ 
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Theorem 3.4 (Corrections). If the sequence {an} in il9\} is umhrally repre- 
sented by the umbra a and the sequence {an} in \20^) is umhrally represented 
by the umbra a, then 



a = a + h 



Proof. By linearity, we have 



E 



a 



m — 1 — 2j 
2m 



l\ h 1 

-l.L H U + - 

2 m\ 2 



(21) 



m 



E 



sh m — 1 — 2j \"' J, / sh 
m 2m I \ m 



The result follows by observing that 



m ^ 

j=0 



m—l 



m — 1 — 2j 
2m 



^ a m — 1]^ j 
h 2m I m 



h 2m m 



(22) 



where equivalence (|22|) follows from equivalence ([7]), by replacing x with 
{— l.t + ^ — ^^^} • Suitably rearranging the terms, we obtain equivalence 
(EU). □ 



It is interesting to compare equivalence (112p with (I2ip . In this last equiv- 
alence, we find just one addend more, which is an umbra whose moments 
are given in Proposition 13.21 Obviously, if m — )• cxd from (|2ip we recover 
(jl2p . as stated by Craig in [5j. Moreover, since from (j2ip we find 



a = d + h{L + - 



m\ 2 



(23) 



equivalence (|23p differs from equivalence (|13p for an umbra whose moments 
are given in ()17p . These observations turn to be useful when we seek ex- 
pression of raw moments {an} in terms of grouped moments {dn}- To the 
best of our knowledge, the last version available is given by Craig in [5], but 
its structure is quite complex and involves integer partitions. Here we give 
a different expression. By applying the binomial expansion and the linear 
functional E to equivalence (j23p . for all nonnegative integers we have 



E 

fc=0 



E 



a + h 



1 

^+2 



E 



1 

-l.L - - 

2 



^- (24) 
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Moments of d + /i (t + ^) are given by Sheppard's corrections ([T|) in the 
continuous case, while the moments of — l.t — ^ are given in (fT7|) . So ah we 
need is to replace these expressions in equation (p^ : 

fc=0 \ / \ / ' j-=o V ^ / 

By a Maple procedure, it is straightforward to verify that the first eight 
moments {ai, . . . , ag}, computed by means of this formula, are the same as 
given by Craig in [5j. 



4 Corrections to multivariate grouped data 

In |10j , it has been shown that the notion of multiset is the key for dealing 
with multivariate moments in umbral syntax. Here we recall briefly the 
notations. 

Let M be a multiset of umbral monomials. A multiset M is a pair (M, /), 
where M is a set, called the support of the multiset, and / is a function from 
M to nonnegative integers. For each ^ G M, /(//) is the multiplicity of fi. 
We denote a multiset (M, /) simply by M. When the support of M is a 
finite set, say M = {^1,^2, • • • , Aij}, we write 

M — l/x^ , I or M — . ,fii , jsj, . ,fij f. 

/(/^i) fin) 

Set 

m=ll^'^''^^- (25) 

For example, if M = fi^^^}, we denote by fiM the product /if/U2^|. 

A multivariate moment is the element of TZ corresponding to the umbral 
monomial /ij\,f via the evaluation E, i.e. 

E[fiM] = mt^...tj, (26) 

where ti = f{fj,i) for i = 1,2, For example if M = //g^^ /ig"^^}, 

we have E[ij,m] = Tn2iA- When the umbral monomials are uncorrelated, 
'niti...tj becomes the product of the moments of //j. More details on the 
meaning and the use of the symbol are given in |10j . 
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Suppose X = (Xi, . . . , Xj) a multivariate r.v. with the joint density 
function /x(x) over W. Note that by using the same arguments, we can 
deal with any range of bounded rectangle type. As usual 



m-ti...t, 



/ x*/.--x*Vx(x)dx 



(27) 



denotes the multivariate moment of X of order {ti, . . . ,tj). The moments 
calculated from the grouped frequencies are given by 



where 



^ \ / / (xi + zi)*i---(x,+z,)*^/x(x)dxdz, (28) 
"1 ■ ■ ■ Jr. Jw 



R 



■J 



.Zj) G R^' : Zk G (^-^hk, ^hk^ V/c G {1, 



and {/ifc} G M\{0} are the width window for any component. A proof of 
()28p can be done similarly to the one sketched for the univariate case ([9]). 

Theorem 4.1 (Multivariate Sheppard's correction). If the sequence {'rhti...tj} 
in i28\) is umhrally represented by the umhral monomial fiM, with M a mul- 
tiset of finite support {fii, . . . , fij}, and the sequence {mtj^...tj} in is 
umhrally represented by the umhral monomial fiM, then 



^ + h 



(29) 



M 



2 



n 



where we set 

fi + h 

I \ ^/Jm k=i 

with {ik} uncorrelated Bernoulli umbrae. 
Proof. Due to linearity, we have 

E[{fii + z,Y' ■ ■ ■ + z,Y^] = I (xi + zi)*i 

Jw 

Moreover, from (1281) we have 



-l.ik 



{xj + Zjpfx{^) dx. 



/ (^1 + zi)*^ • • • {^lJ + zjY^ dz, 



k=l 



k=l 



by which the result follows. 



□ 
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In the support of the multiset M, if we choose 

/^fc = /ifc + /ifc ^ifc + , k = l,...,j 

then equivalence (p9|) becomes an identity and so by notation ([26 

1 



il + hi i + 



(30) 



M 



where again 



fi + h[i + - 



M 



n 

k=l 



Equivalence (j3U|) can be easily implemented in any symbolic package. In- 
deed, in order to recover expressions of raw multivariate moments in terms 
of grouped moments, we need to multiply summations like 



E 



1) Bn- 



corresponding to the n-th power of /ifc + hk {ik + ^) ) and then replace oc- 
currences of products like /ii^/i2^ • • • jl^j^ with ms^...sj . 

For the sake of brevity, we skip the details of the proof of corrections 
when the multivariate parent distribution is discrete. This can be done 
taking the same way used for the univariate parent distribution, as we have 
done for the continuous case. Here, the corrections to moments due to the 
grouping can be formulated in umbral terms as: 



Mm 



2 / m 



-l.L 



1 



(31) 



M 



where, by the symbol in the right hand side of the previous equivalence, we 
denote the following product 



n 

k=l 



l^k + hk { ik + 7; ] ^ 

2 J ruk 



-l.ik 



1 



where ruk are the number of consecutive values grouped in a frequency class 
of width hk- The multivariate version of similarity (j2ip is 



h 



m 



1 



+ — i'^+n] +h{ -i.i- - 



1 



(32) 



M 
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Concluding remarks. The main goal of this paper is to show how the 
corrections of moments resulting from grouping into classes may be summa- 
rized in few closed- form formulae. Moreover, the multivariate formulae can 
be constructed from the univariate ones, by a suitable indexing of umbral 
monomials with a multiset. 

Once more, this paper shows how the classical umbral calculus should 
be taken into account for managing sequence of numbers related to r.v.'s, 
since many calculations are reduced. For example, the reader interested in 
recovering corrections for cumulants and factorial moments, by using the 
classical umbral calculus, can refer to [9j. The noteworthy simplification 
in the expression of corrections, when referred to cumulants instead of mo- 
ments, was first pointed out by Langdon and Ore [19j for a continuous parent 
distribution over (—00,00). By using the umbral syntax introduced for the 
a-cumulant umbra and the a-factorial umbra, it is possible to recover these 
corrections by one line proof both for continuous and for discrete parent 
distributions. 

The umbral techniques applied to Sheppard's corrections open the way 
to deal with new problems that would be interesting to explore. Indeed, 
the umbral version of Sheppard's corrections, here introduced, refers to the 
averaging interpretation of these corrections, as proposed by Craig j5j. Dif- 
ferent interpretations give rise to different forms of Sheppard's corrections, 
see for example [24]. It would be interesting to see if the umbral method 
simplifies the calculation apparatus and add some new formulae also for 
these different interpretations. 
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